Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 19 November 2012 



(MN I^TeX style file v2.2) 



Applications of Integrated Photonic Spectrographs in 
Astronomy 

(N R.J. Harris^'*' and J. R. Allington-Smith^ 

^ Centre for Advanced Instrumentation, Physics Dept, Durham University, South Rd, Durham, DHl 3LE 
19 November 2012 

ABSTRACT 

One of the problems of producing instruments for Extremely Large Telescopes is 
that their size (and hence cost) scales rapidly with telescope aperture. To try to break 
this relation alternative new technologies have been proposed, such as the use of the 
Integrated Photonic Spectrograph (IPS). Due to their diffraction- limited nature the 
IPS is claimed to defeat the harsh scaling law applying to conventional instruments. 
■ In contrast to photonic applications, devices for astronomy are not usually used at 

the diffraction limit. Therefore to retain throughput and spatial information, the IPS 
requires a photonic lantern (PL) to decompose the input multimode light into single 
modes. This is then fed into either numerous Arrayed Waveguide Gratings (AWGs) 
or a conventional spectrograph. 

We investigate the potential advantage of using an IPS instead of conventional mono- 
lithic optics for a variety of capabilities represented by existing instruments on 8m 
telescopes and others planned for Extremely Large Telescopes (ELTs). To do this, 
we have constructed toy models of different versions of the IPS and calculated the 
relative instrument sizes and the number of detector pixels required. This allows us 
to quantify the relative size/cost advantage for instruments aimed at different science 
requirements. We show that a full IPS instrument is equivalent to an image-slicer. 
Image-slicing is a beneficial strategy for ELTs as previously demonstrated. However, 
the requirement to decompose the input light into individual modes imposes a redun- 
dancy in terms of the numbers of components and detector pixels in many cases which 
acts to cancel out the advantage of the small size of the photonic components. How- 
ever, there are specific applications where an IPS gives a potential advantage which 
we describe. Furthermore, the IPS approach has the potential advantage of minimis- 
ing or eliminating bulk optics. We show that AWGs fed with multiple single- mode 
inputs from a PL require relatively bulky auxiliary optics and a 2-D detector array 
which significantly increases the size of the instrument. A more attractive option is 
to combine the outputs of many AWGs so that a 1-D detector can be used to greatly 
reduce the number of detector pixels required and provide efficient adaptation to the 
curved output focal surface. 
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1 INTRODUCTION 

Spectroscopy is one of the most useful tools in astronomy, 
with applications in fields ranging from Cosmology to 
Exoplanet studies. Arguably the most common form in 
astronomy is dispersive spectroscopy which uses a dispersive 
element to separate different wavelengths. 
In its simplest form a dispersive spectrograph contains 
four components: A slit to isolate the area to be dispersed, 
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a collimator, a grating or prism to disperse the light 
and a camera and detector to record the intensity at 
each wavelength. In order to retain throughput the slit 
of the spectrograph is usually matched to the seeing of 
the telescope. If this is not diffraction-limited then the 
collimated beam must increase in size as the telescope 
grows in size in order to m aintain the same resolution 
^Lee Allington-Smithll200Ql ). The immediate consequence 
of this is that the optics and disperser must also grow in 
size. This then leads physical problems such as stresses and 
flexure in the materials, along with the difficulties inherent 
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in building these large monolithic structures. 
In order to use the same design principles as existing instru- 
ments more exotic materials and construction processes are 
needed, which drives the costs of building the instruments 
up. Using the conventional approach the cost of instruments 
scales with at least the square of the telescope aperture 
(jBland-Hawthorn k Hortonll2006l ). 

In order to reduce these problems the input to the spec- 
trograph can be sliced by dividing the input field (the slit) 
into a number of thin slices. Each of these can then be 
fed into a spectrograph or spectrographs. This technique 
is known as image slicing. Each of these spectrographs 
produces a spectrum for each slice. The resulting data 
is then reformatted into a 3D datacube with axes x,y,A 
allowing the reconstruction of the initial image. 
There are several different methods for image slicing de- 
tailed in the lit erature, from lenslet arr ays feeding fibres to 
slicine: mirrors (jAllington-Smithl l2007al ) . All have inherent 
advantages and disadvantages. The various methods have 
all been widely adopted in recent years and are used in 
various forms in instruments on the current generation of 
8m telescopes. Theoretical investigations into the scaling 
laws in relation to image slicing suggest it will be an even 
more powerful tool to reduce instrument sizes and costs 
on the next generation of 30m telescopes ( Allington-Smi t3 
l2007bh . 

In this paper we examine the potential applications of 
the Integrated Photonic Spectrograph (IPS), which shares 
some of the features of image slicers. Despite the physical 
differences they obey the same basic Physical laws as 
conventional instruments. 

The IPS devices take light from an input fibre which is 
usually matched to the seeing limit (as with conventional 
fibre fed instruments) and so supports many modes. The 
light from this multimode fibre (MMF) is then split into 
a number of single mode fibres (SMF) by a photonic 
lantern (PL) . At this po int tw o options have been proposed 
(jBland-Hawthorn et ahlboioh . 

P=l: The first requires a reform atting component 
(jBirks et al.1 l2012l : |ThomsonelaD ^1) to form a slit of 
SMFs which can then be dispersed by bulk optics. We shall 
call this the semi-photonic case. 

P=2: In the second the SMFs are then fed into Arrayed 
Waveguide Gratings (AWGs) which disperse the light into 
individual spect ra, with one or more specta per AWG 
(fBland-Hawthorn et al.l I2OIOI ). These spectra have the 
advantage that they are in a linear format, so can be 
sampled using an array of fibres or a linear detector. This 
is our fully-photonic case. 

While the principles for both have been demon strated 
(jLeon-Saval et al.ll2012l : ICvetoievic et al.ll2009l . [2oTi l. so far 

these have only used single or few modes from a single in- 
put, resulting in limitations in throughput and field. They 
are also not optimised in terms of size or the components 
used. This led us to investigate how complete instruments 
would perform compared to current instrumentation, with 
initial results suggesting that a fully-photonic IPS would be 
best suited to small diffraction-limited telescopes with small 



In this paper we determine the application areas where 
the IPS may have an advantage over conventional in- 
strumentation. After noting the formal similarity between 
image-slicing and photonic spectroscopy in section [21 we con- 
sider the requirement for the field of view of the instrument 
in section O Simplified models of the IPS are presented in 
sections |4] and [21 The results of comparing conventional and 
IPS instruments is given in section [6l In section [71 we dis- 
cuss different ways to reduce the number of AWGs and/or 
detector pixels, before presenting our conclusions in section 

m 



2 INSTRUMENT SIZE SCALE 
RELATIONSHIPS 

It is often claimed that IPS violates the relationship between 
telescope diameter and spectral resolution: 
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where R is the resolution of the instrument, m is the diffrac- 
tion order, p is the ruling density, A is the wavelength, W 
is the length of intersection between the grating and colli- 
mated beam, x is the angular slitwidth, Dt is the diameter 
of the telescope, 7 is the blaze angle, p is the ruling density 
and Dcoi the diameter of the collimated beam. 

This applies to a slit spectrograph using a diffraction 
grating as the dispersive element and shows that for a given 
resolution, angular slitwidth and blaze angle that the diam- 
eter of the collimated beam must increase in proportion to 
the telescope aperture, leading to a bigger instrument. 

Unlike a conventional spectrograph the input to the IPS 
must be diffraction- limited (A ~ X^t) due to its single mode 
nature, so the resolution can be shown to be of the form 

i?=^. (2) 

where Ny^g is the number of waveguides in the AWG 
model or number of rulings on a conventional grating and C 
a factor to account for manufacturing errors (|Lawrence et al.l 
2010 ). This has no dependence on telescope diameter so it 
would appear to break the relation. It must be noted though, 
this applies to a device operated at the diffraction limit of 
the telescope, not at the seeing limit as with equation [J 

To examine what happens when the input at the seeing 
limit we consider the n umber of sp atial modes in a conven- 
tional step-index fibre |Ched [i99Q| ) which can be approxi- 
mated as 



M : 
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It is useful to remember that each spatial mode has 
two polarisation states, though we do not include the factor 
here as each single mode fibre accepts two polarisations. The 
associated V parameter is 



(4) 



fields of view ^Harris Allington- Smithll2012l l. 



Where s is the diameter of the fibre core (assumed equal to 
the slitwidth in equation [1]) and G the numerical aperture at 
which the fibre is operated. This must be less than the lim- 
iting (i.e. maximum) numerical aperture of the fibre. Noting 
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where Ft the telescope focal ratio and /t is the telescope 
focal length, the number of modes is given by 



M 



(7) 



Therefore it can be seen that for each sampling element 
the number of modes increases as the square of the tele- 
scope diameter in a similar way to the number of slices at 
the diffraction limit (x^t/1.22A)^. This confirms that, to 
first order, photonic spectrographs are bound by the same 
scaling laws as conventional spectrographs. In what follows, 
we attempt to quantify areas where photonic spectrographs 
may confer an advantage, and suggest modifications which 
may allow photonic spectrographs to make a significant im- 
pact on future astronomical instrumentation. 



3 THE INPUT FIELD AND SPATIAL 
MULTIPLEX 

In the previous section we calculated the number of modes 
per spatial sa mpling element (spaxel). As with d iverse field 
spectroscopy urrav Anington-SmithI [2009), photonic 
spectrographs address a number of individual spaxels, which 
can be grouped (as in Integral Field Spectroscopy; IFS), or 
separate (as in Multi Object Spectroscopy; MOS). In or- 
der to fairly compare with conventional instrumentation we 
need to make sure we sample the same number of spaxels 
(e.g. observe the same field). 

A long slit can be thought of as a series of spaxels joined 
to form a rectangle of size 1 x A/", where N is the total number 
of spaxels. In IFS, the field is equivalent to a series of slits 
(each composed of linked spaxels) joined so the total number 
of spaxels \s N — NxNy, where Nx is the number of spaxels 
in the x direction and Ny in the y direction. MOS can be 
thought of as the same number of spaxels, distributed 
throughout the field of the telescope and brought together 
to form a long slit. See Fig. [2] for an illustration of this. 

An important consideration is the sampling of the field. 
From Fig.[T]and equation[71it can be seen that the number of 
modes produced is dependent on the overall size of the field, 
not the individual spaxel size (as the number of modes per 
spaxel is proportional to the square of the spaxel size) . This 
means the number of components for the IPS (and hence 
the approximate size of the instrument) required for the in- 
strument will not depend on the sampling scale. 
However, the amount of spatial information and through- 
put will depend on the sampling scale. Although it might 
appear best to reduce the spaxel size and the number 
of modes, this will reduce the coupling efficiency and 
throughput ( Corbett Allingt on-Smithll2Q06l ). At the other 
extreme the use of very large fibres would result in loss 
of spatial information. A balance must be found between 
throughput and spatial resolution. We do not investigate 
this fully here as it does not affect the total number of modes 
in the field or the required number of detector pixels. Thus 
we choose to make our spaxel size equal to the FWHM of 
the seeing. 
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Figure 1. An example of the number of modes generated from 
a single spaxel on an 8m telescope of 0.5" FWHM seeing at A 
= 1650nm. Fixing the size of the spaxel to the FWHM gives a 
single spaxel (here number of slices = 1), this spaxel is large and 
contains many modes. Slicing the spaxel produces smaller spaxel 
sizes, but larger numbers of them (the number of spaxels is the 
slices squared). This results in the same total number of modes in 
the area (the horizontal yellow line). The variation in the blue is 
due to the integerisation of modes within individual slices. Note 
that the three final red points lie under the blue ones. 



In order to calculate the scale length of the instrument 
we take the cube root of the volume of the instrument. We 
can calculate this from the number of spaxels in the total 
field passed to the spectrograph 



scale length = ^/NM{Xn 



^ LxLyLz 



(8) 



Where M is a function of the shortest wavelength in 
the spectrograph (Am^n), in order to account for all spatial 
modes. L^, Ly and Lz are the lengths of an individual com- 
ponent spectrographs or AWGs in the x, y and z direction 
respectively which will be defined in the next two sections. 
P—1 and P—2 represent the semi-photonic and photonic 
cases respectively. 



4 THE SEMI-PHOTONIC IPS (P=l) 

The semi-photonic case involves taking an individual 
spaxel and using a re-arranged PL to form a diffraction- 
limited slit, which is then dispersed by bulk optics 
(jBland-Hawthorn et al.ll2Qlol ). 



4.1 Model Geometry 

Slicing the input of a s pectrograph has already been ex- 
amined theoretically in (jAllington-Smithl l2007bh . The pa- 
per took existing instruments and sliced the input, either 
adding the slices to the length of the slit, or placing them 
into replica spectrographs. It showed that slicing could re- 
sult in an instrument with a slightly smaller overall volume, 
though the instruments sliced to the diffraction-limit were 
shown to be larger than their counterparts due to the extra 
components required. This is important to us as we showed 
that photonic spectroscopy is similar to image slicing to the 
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Figure 2. An illustration of conventional slicing and Photonic slicing. All methods sample an area of the same size (e.g. the same number 
of spaxels). The three conventional methods reformat the input and disperse it, producing one spectrum per spaxel. The fully-photonic 
option takes the each input spaxel and splits it into individual modes using a photonic lantern. Each of these modes is fed into an AWG 
to produce a spectrum. These then need to be recombined and summed to produce the spectrum for the spaxel. The semi-photonic 
option uses the same photonic lantern, but this is then reformatted into a long slit and fed into conventional spectrographs. 



diffraction limit in section O 

As conventional image slicing has already been examined we 
restrict ourselves to examining only the IPS concept, which 
takes each individual spaxel (not a number of them) and 
separates it into a single spectrograph. 

We wi ll be using the modified model from (lAllington-SmithI 
l2007bl ) described in ([Harris k Allington-SmithI l2012h and 
adding this to our results. As the input to each spectro- 
graph now depends on the number of modes per spaxel we 
shall be setting the length of the slit to the number of modes 
(equation [7]) instead of Uy in the previous papers. 



4.2 Semi-photonic model limits and Calibration 

In order to calibrate the m odel we use the same method 
as ( Alling ton-SmithI l2007bh , with the S=l case oversizing 
using a multiplicative factor and the S=2 case oversizing 
the spectrograph input beam. The scaling factors for our 
instruments can be found in Table [T] 



5 THE FULLY-PHOTONIC IPS (P=2) 

For our fully-photonic model we shall concentrate on mod- 
elling the size of the AWGs, not the components that feed 
them. We shall include a factor for our detector sizes. 
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Table 1. The scaling parameters for the semi-photonic versions of 
the conv entional instruments. Th e scali ng scenarios are described 
fully in (|Allington-Smithl l2007bl ) and (|Harris Allington-SmithI 
l2012l V 



Instrument 


S 


=1 


s = 


:2 




a (m) 


b 


a (m) 


b 


GNIRS 


0.1 


2.1 


0.46 


1.1 


CRIRES 


0.1 


2.2 


0.61 


1.1 


NIFS (J) 


0.1 


7.0 


0.86 


1.1 


SINFONI (H) 


0.1 


7.0 


0.86 


1.1 


IRMS 


0.1 


8.0 


2.06 


1.1 


IRIS 


0.1 


10.0 


1.46 


1.1 



5.1 Model geometry 

For the fully-photonic model we first need to consider the 
geometry of AWGs. These are available in many different 
variations, especially with respect to the geometry which 
generates the path difference between waveguides (e.g. S- 
bend, circular, horse-shoe). To kee p the toy model simple 
we have ch osen a reflective AWG ('Grave de Peralt a et al.l 
using a Rowlands Circle arrangement for the 
Free Propagation Region (FPR). We have removed the bend 
at the end of the waveguide array for simplicity. Because of 
this it looks almost identical to a conventional double-pass 
Echelle spectrograph. 

Using the definitions in Fig. [3] we arrive at the following 
equations for the size of the AWG model 



Lx = (max(D, E) + aawg) bawg 

Lz = {Uamg + A+ {N-ujg - l)AL)bawg- 



(9) 
(10) 

(11) 



Here A is the x-length of the FPR, AL the length dif- 
ference between adjacent waveguides to achieve the required 
order for a given central wavelength (Ac),Z) is the length con- 
taining the waveguides (analogous to the illuminated length 
of the grating in a standard echelle grating) , E the x length 
of the detecting surface, w is the waveguide diameter. The 
oversizing parameters aawg, bawg and Cawg parameterise the 
extra size required to implement a practical device. 

First we calculate the appropriate dispersion order, m, 
in terms of the free spectral range (FSR, AXfsr) for an 
AWG 



A71 



(12) 



Setting D = Nwg/p, this can be combined with equation 
[2] where p is the density of waveguides (analoguous to the 
ruling density of a conventional disperser) to give 



D 



CR 
mp ' 



(13) 



The physical extent of the FSR in an arrayed waveguide 
grating is Xfsr = {^minLFp/ris), where Us is the refrac- 
tive index of the slab and Lf the length of the free space 
propagation region. Combining with geometrical arguments 
gives 



Lf sm — — Lf sm . 

V 2 y \ Us J 



(14) 



Where ^ is defined in Fig. O In order to calculate AL 
we make use of the equation for calculating the central wave- 
length of the AWG 

XcTTl 
Tie 



AL 



(15) 



where ric is the refractive index of the waveguides, the cen- 
tral operating wavelength is Ac =Amin + AAfsr/S, so A can 
be calculated from geometry as 



A — Lf cos(^) — Lf cos 



2pL^ 



(16) 



where is defined in the figure and Nwg is calculated 
using equation[2l In order to calculate Lf we make use of the 
fact the imaging requires the number of detector pixels to be 
able to adequately sample at the resolution required (eq uiv- 
alent t o sampling of the Echelle model in (Allington-SmithI 
l2007bl ). To do this we take the dispersion relation 



\6x J \dx J 



(17) 



LfTTip' 

and combine it with equation [TJl setting Sx = Nodp 
where A^o is the oversampling and dp is the size of the pixels 
We also take the equation for the spectral resolution ^A = 
Xmin/R- Minimising to obtain the maximum Lf we find 

risNodpRAXFSR 



Lf ^ 



pX 



2 

min 



(18) 



Finally we can calculate the number of pixels we need 
for the required resolution 



Np = 



Lfsr _ XcLfP 
Nodp Us dp 



(19) 



5.2 Fully photonic model Limitations and 
Calibration 

Astronomical spectrographs are usually designed to operate 
with a large free spectral range (typically several hundred 
nm). This is a problem for the IPS because conventional 
telecoms AWGs are designed with low free spectral range in 
order to deal with the discrete narrow band input from the 
telecoms industry. For astronomy, single AWGs need to be 
redesigned to work in lower spectral orders by reducing the 
path difference between adjacent waveguides. This requires 
more waveguides to maintain the maximum theoretical 
resolution and an increase in Lf to maintain a practical 
one (see ean fTS)) . This produces very large AWG dimensions 
which cannot b e manufactured due t o chip manufacturing 
size constraints ([Lawrence et al.ll2010l V 

We wish to avoid this problem and retain a fully inte- 
grated design with no external optics. As such we make use 
of the tandem AWG arrangement, where a primary AWG 
filters the light by wavelength into secondary AWGs, each 
encompassing a fraction of the original FSR (see Fig. 2] and 
(Takada et al. 2001)). This allows the individual component 
dimensions to be within manufacturing limits whilst allow- 
ing our full device to sample the correct FSR. It would also 
allow the AWG design process to remain similar to current 
specifications. 

Since no fullscale AWG instruments currently exist, 
we are required to use a bottoms-up approach to estimate 
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Figure 3. The fully-photonic model. The left image shows the x-y view of the AWG, with the (left to right) input fibre, Free Propagation 
region and waveguides. The top right image shows the side view of the AWG model, with the top and bottom cladding layers and the layer 
containing the waveguides in the centre. The bottom right image is an enlargement of the FPR which is a Rowlands Circle arrangement. 



its size. To do so we take an existing AWG acquired from 
Gemfire Livingstone. We use its known parameters and 
adjust the models produced dime nsions until they matc h 
the real ones. In order to emulate (i Allington- Smith! l2007b[ ) , 
we set the scaling parameters to two extremes. 

S=l : Minimise b aw g = 1-1, which yields a aawg — 10mm 
S=2 : Minimise aawg — 0mm, which yields bawg — 2.8 

For both scenarios we keep Cawg — 0.7mm as the device 
is planar so the height should not change. 

For simplicity we will not include the volume of the 
initial multimode fibre bundle, the photonic lantern or the 
housing of the instrument. We will however include a esti- 
mate for the size of the detector. This value is calculated 
assuming the size of a typical detector sub-system including 
the cryostat. This is estimated as lO"^??!'^ per detector pixel. 




Figure 4. An illustration of the tandem and single AWG setup. 
The image on the left shows the conventional AWG dispersing the 
whole spectrum. The image on the right is the tandem configu- 
ration. The intial AWG (or other disperive optic) splits the light 
by wavelength (here to /Wpsn/'^) and feeds the second set of 
AWGs. This has the advantage that each individual AWG can be 
smaller, though it requires more AWGs, additional components 
(feeding fibres) and is subject to extra loss of throughput. Note 
that the length of the output must be the same in both cases in 
order for the detector to sample adequately. 



6 RESULTS 

To estimate the uncertainties within our model, we use both 
of our oversizing options for both models and also vary C 
between 1 (diffraction-limited) a nd 4 (the initial results ob- 
tained in (|Cvetoievic et ahlbOO Q')) for our fully-photonic in- 
strument. This gives us two extremes a nd allows for the 
curre nt maximum of C =1.6 achieved in (|Cvetoievic et all 
I2OI2I I. 

To test our models we choose two sets of instruments. 
First those designed to represent current instrumentation 
on 8m telescopes. Then we test against instruments that 
have been designed for the TMT. We shall only investigate 



a single mode of operation for each instrument. This may 
be oversimplifying as most instruments are designed to 
operate at different resolutions and at different wavelengths 
by changing gratings or optics. It however keeps the model 
simple and IPS devices could be adapted to suit various 
purposes, this is discussed in the conclusions. 

The first set of instruments are in current use on 8m 
telescopes and are intended to represent generic instrument 
types. We use the parameters in Table [2] to calculate the 
volume of the instruments. The instruments we have chosen 
are: 
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(i) Gemini Near InfraRed Spectrograph (GNIRS) on 
Gemini-North. The instrument has an overall wavelength 
range of 1.0-5.4/im, resolutions of between 1,700 and 18,000 
and slit widths of between 0.1 and 1.0 arcseconds. It has 
an imaging mode, a long slit mode and originally an IFS 
mode (destroyed during maintenance at the telescope). We 
will be comparing our photonic instrument to the long slit 
configuration. 

(ii) CRyogenic high-resolution InfraRed Echelle Spec- 
trograph (CRIRES) is a high resolution spectrograph 
on the Very Large Telescope (VLT). It is designed to 
operate between 1.0-5/im, with a resolution of up to 10^. 
We have chosen it to illustrate a long slit high resolution 
spectrograph. 

(iii) Near-Infrared Integral Field Spectrometer (NIPS) 
on Gemini-North is our first IFU instrument. It is designed 
to work with the Adaptive Optics system, over 0.9-2.4 /jm. 

(iv) Spectrograph for INtegral Field Observations in the 
Near Infrared (SINFONI) on the VLT is our second IFU 
instrument. It operates in the 1.1 to 2.45/im range again 
with AO. 



We have also chosen two h ybrid instruments p roposed 
for the Thirty Meter Telescope (|Simard et al.ll2010l ). IRMS 
will employ 10 or more IFUs. Each one will have a 2 x 2 
arcsecond squared field of view with a 50% of the energy 
enclosed by 50 mas at wavelength l/xm, resulting in 1600 
spaxels per IFU. IRIS has 3 IFU units, two of which will 
be lenslet arrays (for observing smaller fields) and one will 
be an image sheer (larger fields). Here we model the sheer, 
which has 88 mirror facets, but will keep the best resolution 
possible with our AWG model. As with the 8m instruments 
, the instrument scale lengths are fitted to values taken 
from the literature, see Table O 

Table [3] shows the resulting parameters in the fully- 
photonic case. The total number of AWGs required are 
shown in the second column, this number will be in the 
tens of thousands for 8m instruments and the hundreds of 
thousands for the 30m instruments. The large number of 
AWGs requires rigorous quality control to test the large 
number of individual components. This may be of advan- 
tage though, as the individual AWGs should be less prone 
to flexure and, due to their modular nature, are better 
suited to mass production and upgrades and expansion to 
suit cashflow. Note that the size predictions do not include 
provision for mounting hardware required to support the 
instrument components or to provide a suitable controlled 
environment. 

The next eight columns show the different resulting nor- 
malised scale lengths of the instrument. Note that this is 
the scale length of the overall instrument, not the individual 
components. The first four are the scale lengths without 
provision for the detector size and show that the total 
size of the 8m instruments will be on the same order as 
the conventional instrument. If the diffraction limit can 
be achieved the resulting instruments are smaller for all 
scenarios, with the exception being NIFS with the S—2 



scaling. If the diffraction-limit cannot be achieved the 
instruments will have a scale length larger than the original 
instruments. The results for the 30m instruments are 
similar to the 8m ones, with the 5'=1 scaled case of IRIS 
being slightly smaller and the rest being slightly larger. 
The second four include the provision for detector and 
show similar results, though the scale lengths are increased 
slightly as expected. This shows though the size of the 
additional detector pixels (discussed later) will not pose a 
significant size restriction on the instrument. 

Table 2] shows the results corresponding to the number 
of modes per spaxel and hence the requirements in terms 
of detector pixels. The second column shows that all of the 
instruments will have around 30 modes per spaxel, with the 
exception of NIFS, which will have 10. This causes prob- 
lems with oversampling in the fully-photonic model due to 
each mode needing to be sampled using two detector pix- 
els per resolution element (Nyquist sampling). There needs 
to be some way of combining the individual spectra to stop 
massive oversampling (shown in column 3) . This will be dis- 
cussed later, but will probably involve additional compo- 
nents, increasing the size of the instrument. 

Table [5]shows the results from the semi-photonic model. 
The second column shows the number of replica photonic in- 
struments will be 10^- lO'^. As such using the semi-photonic 
method will require mass production of the replica spectro- 
graphs which is not common in the astronomical instrumen- 
tation community. The alternative is to put many spaxels in 
the same spectrograph and would require a balance between 
redundancy and overlarge components for this version is to 
work. 

The next two columns show that using the semi-photonic 
model for the instrument will result in m uch l arger ins tru- 
ments . This matches with the results of (|Allington-Smithl 
l2007b ). where as the input was sliced more the instrument 
tended to get bigger. As stated in section 2] we are slicing 
the instrument to the diffraction-limit in the spatial direc- 
tion and then separating each spaxel into a separate spec- 
trograph, which imposes huge redundancies. The number of 
required detector pixels are shown in the final column for 
this we are assuming that all the detector separate modes 
can be combined onto a linear detector, which may not be 
possible. 



7 MODIFICATIONS TO INTEGRATED 
PHOTONIC SPECTROGRAPHS 

From the results already discussed, it is clear that IPSs in 
their current state offer little or no advantage in terms of 
size and detector pixels when compared with existing in- 
struments on large telescopes or those planned for ELTs. 
However it is possible to envisage modifications to the fully- 
photonic device which would make it possible to exploit the 
unique features of photo nic spectrographs. One such scheme 
is already being studied (|Cvetoievic et al.ll2012l V 

7.1 Multiple-input Arrayed Waveguide Gratings 

So far we have restricted ourselves to one input per AWG 
(i.e. one mode per AWG), which as shown in previous sec- 
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Table 2. Table of input parameters all instruments. Symbols except for Njpjj ■> the number of IF Us in the 
instru ment, are expl ained in the text. The num bers are taken (a nd approximated from) ( Allingjto n-Smithl 
l2Q07bl ) for GN IRS. (lKaeuflll2004l) for CRIRES, (iMcGregor et al.|[20 03) for NIFS, dEisenhauer et al.ll2003l ) 
for SINFONI, (lEikenberrv et al.ll2006r) for IRMS and (iLarkin et alBoiQ ) for IRIS. 



Instrument 


X 




Total Spaxels 


R 


Ac 


^FSR 


P 


Vol 




(") 








(nm) 


(nm) 




(m3) 


GNIRS 


0.3 


330(1) 


330 


5,900 


1,650 


400 


31.7 


2.00 


CRIRES 


0.3 


200(1) 


200 


100,000 


1,650 


48 


31.6 


3.00 


NIFS (J) 


0.1 


30(29) 


870 


6,050 


1,250 


600 


600 


2.75 


SINFONI (H) 


0.2 


32(32) 


1,024 


3,000 


1,650 


400 


128.57 


2.75 


IRMS 


0.1 


40 (40) [10] 


16,000 


10,000 


1,200 


400 


128.57 


16.00 


IRIS 


0.1 


60(60) [1] 


4,000 


8,000 


1,200 


400 


310 


55.00 



Table 3. Table of the resulting scale lengths of the respective fully-photonic instruments. The total number 
of modes in the whole instrument is shown first, with the next four columns showing the respective sizes 
for the model with no detector. This is followed by the model with detector. All of the scale lengths are 
normalised to the cube root of the volume in Table [21 The AWG model uses a waveguide separation of p = 
200mm- ^ 



Normalised Scale length 





Total 




No detector 






With detector 




Instrument 


number 


C= 


:1 




=4 


C-- 


=1 








of AWGs 


S=l 


S=2 


S=l 


S=2 


S=l 


S=2 


S=l 


S=2 


GNIRS 


13,000 


0.49 


0.87 


1.46 


1.55 


0.51 


0.89 


1.47 


1.55 


CRIRES 


6,300 


0.50 


0.90 


1.19 


1.32 


0.60 


1.09 


1.22 


1.42 


NIFS 


8,900 


0.79 


1.45 


1.66 


1.93 


0.80 


1.46 


1.66 


1.93 


SINFONI 


28,000 


0.39 


0.64 


1.36 


1.39 


0.39 


0.66 


1.36 


1.39 


IRMOS 


520,000 


1.69 


3.08 


3.53 


4.09 


1.71 


3.11 


3.53 


4.11 


IRIS 


110,000 


0.72 


1.07 


1.34 


1.48 


0.73 


1.08 


1.35 


1.49 



Table 4. Further information on the fully-photonic model. The AWG model uses a waveguide separation of 
p = 200mm- 1. 



Instrument 


Modes 


Total detector 


Reference Instrument 




per spaxel 


pixels /(lO^) 


pixels /(lO^) 


GNIRS 


39 


60.69 


1.05 


CRIRES 


31 


47.31 


2.10 


NIFS 


10 


112.43 


4.19 


SINFONI 


27 


66.50 


4.19 


IRMOS 


32 


6263.44 


83.89 


IRIS 


32 


1127.42 


16.78 



Table 5. Results for the semi-photonic model. All of the scale lengths are normalised to the cube root of 
the volume in Table [21 The semi-photonic model uses p stated in Fig. [21 * The number of detector pixels 
assumes that the modes in each spaxel can be reduced onto the detector appropriately, which may not be 
the case. 





Number 


Normalised Scale 


Number of 


Instrument 


of replica 


Length 


detector 




Spectrographs 


S = 1 


S = 2 


pixels* /(lO^) 


GNIRS 


330 


4.30 


5.30 


1.05 


CRIRES 


200 


3.06 


4.66 


2.10 


NIFS 


870 


9.55 


9.54 


4.19 


SINFONI 


1,024 


8.62 


9.36 


4.19 


IRMOS 


16,000 


26.07 


32.48 


83.89 


IRIS 


3.600 


8.49 


9.20 


16.78 
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Figure 5. Cutaway diagram illustrating the difference between 
the single and multiple input versions of the AWG. To the left 
is the single input, which would make use of a linear detector 
array to sample the output spectrum. To the right the multiple 
input version. Here five inputs produce five separate spectra that 
overlap at the output of the AWG. This has a couple of implica- 
tions, first the spectra would need to be cross dispersed in order 
to be sampled and secondly the end of the second FPR would 
need to be larger (though not the input which would remain the 
size of the waveguides). The path difference in the waveguides is 
not illustrated in this diagram for simplicity. 



tions requires many AWGs. Placing multiple inputs per 
AWG would reduce the dependence on equation [71 at the 
extreme eliminating it entirely if all the inputs could fit into 
a single AWG. In order to introduce these extra inputs ad- 
ditional fibres are placed at different positions on the input 
FPR. This introduces a path difference between each input 
with respect to the central one (see Fig. [5] and [6|). This path 
difference is carried through the system and results in the 
output spectrum of each mode being shifted in the disper- 
sion direction at the output. To remove the overlap between 
spectra, it will be necessary to introduce cross dispersion. 

The AWG also produces multiple diffraction orders (as 
with a conventional grating) so we also need to make sure 
the inputs all lie within a region half that of the FSR of the 
central input. This is to stop the same wavelength from dif- 
ferent orders lying in the same position in the linear output, 
resulting in the cross dispersed spectra lying the same po- 
sition on the detector. The refractive index change in fused 
Silica is not great enough to disperse the light by polishing 
the AWG at an angle. In order to separate the spectra, the 
outp uts needs to be c ross-dispersed using conventional op- 
tics (|Cvetoievic et al.L200 9) . This means that each AWG 
will need a two-dimensional detector, a dispersive element 
and collimating and camera lenses. Here we look at the rel- 
ative advantages of using cross dispersion in the system. 



7.2 Adding cross-dispersion 

To cross-disperse, we need additional optics, which means 
that the device is no longer fully integrated, potentially 
making manufacture and maintenance more difficult, but 
reducing the number of AWGs required for the device. We 
construct a new toy model to see how the scale length of 
a cross dispersed system (multiple inputs) compares to 
one with linear arrays (single inputs). For this section we 
have modified our fully-photonic model so it is no longer 
reflective and the outp ut of the second FPR outputs is 
linear (e.g. Fig. [5] and (|Lu et al.| [20Q3\). The first FPR is 
still in its original shape so as to allow the multiple inputs. 
Changing the model like this will affect the overall size of 
the instrument (due to the difference in AWG design), but 
will still allow us to examine the relative sizes of the two 
scenarios. 



We retain the single input model for our comparison 
and use the length values calculated in previous sections. 
As such the scale length of the instrument (with all modes) 
is still NawgLxLyLz, where Nawg is the number of AWGs. 

The calculation of the size of the multiple-input AWG 
option follows that for the The cross dispersion option re- 
quires the AWG, a collimator, prism and then camera in 
front of the detector (see Fig. [7|). The equation for the vol- 
ume of the system now becomes NawgLxLyLz, where Nawg 
= N/Ni, the total number of spaxels divided by the number 
of inputs per AWG and the dimensions being defined below. 

We start by examining the output end of the AWG. For 
a single input the x length of the AWG system would be the 
same as described in equation [9l with E will now be Xfsr as 
we have flattened out the output. Adding extra inputs such 
as the ones illustrated in Fig. [7] will increase this x length. 
The maximum distance between inputs must be less than 
Xfsr, to avoid the same wavelength in a different orders 
lying on the same position. For simplicity we assume evenly 
spaced inputs, which when combined with the previous con- 
dition yields equation 1261 We set the maximum number of 
inputs to be Xfsr / Dinput, where Dinput is the diameter 
of the input fibre (here set to 125/im). 

To calculate the y length, we must consider how the spectra 
are to be cross dispersed. We need to make sure that the 
output beam from the system is collimated. To do this we 
make sure the output angle of the collimator is smal: O2 = 
w/ 2fc < 0.01°. where O2 is the divergence in the collimator 
and fc is the focal length of our collimator. As our system is 
diffraction- limited, the diameter of our collimated beam is 



D = Oifc 



(20) 



For cross-dispersion we use a prism, athough a grating 
could also be used. We need to work out the required res- 
olution of the prism, which is proportional to the number 
of inputs (e.g. as the number of inputs increases the FSR 
decreases by that factor) . This gives 



Rx 



(21) 



where Rx is the resolution of the cross dispersed system. We 
can then combine this with the equat ion for the resolving 
power of a diffraction-limited prism (|Fov k Fovl I2OO2I ) to 
yield 



\dnj KAXfsr/ \dnj 



(22) 



where t is the path difference between the upper and lower 
rays in the prism. In order to account for all wavelengths 
we must use the maximum value of the material dispersion, 
dX/dn within the required wavelength range). The vertex 
angle of the prism is 



a — arcsm 



int) 



(23) 



where a is the angle the prism makes to the collimated beam 
and n is the refractive index of the prism. This allows the 
calculation of the vertex angle of the prism 



2 arcsin ( sin a 
\2D 



The output angle of the prism 



(24) 
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L 
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/// 
/// 
/// 
/// 

2D Detector 

Figure 6. The multi-input model for the cross dispersed system. Each spaxel from the input field is fed into a photonic lantern. The 
output SMFs are fed into AWGs, with multiple fibres in each AWG. The output from these AWGs is then cross dispersed onto a 2d 
detector. 



Photonic 
Lantern 



cross 
dispersion 



/3 = TT- 



■ 2a. 



(25) 



We can then calculate Ly from Fig. [71 giving equation 1271 
Finally from above and from the geometry in Fig. [71 we have 

= MAX (^Xfsr ( ^^^~^ ) ' ^) + 2aa^^ (26) 

Ly = y +tsin^ + asin/3 + MAX (^/cSin/3, (27) 
L, = L^ + /eoz+a + tcos^+(a + /c)cos/3. (28) 



7.3 Results for cross-dispersed multiple-input 
AWG 

We now run the simulation for all the instruments detailed 
in section [6l using the model above and fused silica as the 
glass in our prism. We also set the maximum value of t to 
30cm, to represent sensible limits for the prism size. 

By imposing our limit on t we can see in Fig. 17.31 that 
the number of inputs per chip is limited to the tens for all 
the resulting graphs due to equation [22l The potential ad- 
vantage of this is all the resulting modes from a single spaxel 
could be fed into one AWG, meaning each one could be iso- 
lated. 

We can also see that though the instrument size decreases 
(particularly for NIFS, IRMS and IRIS) as more inputs are 
added all instrument sizes will be much larger than the single 
input version. Existing results have only put around 10 in- 
puts on a chip and then cross dispersed by the IRIS2 instru- 
ment (Cvetoievic et al. 2012), which fits with out results. 
There is no result for CRIRES as the prism would have to 
be too large to have sufficient resolution. 

Not shown in the resulting graphs are the numbers of 
pixels required for the instruments, which would be of the 
same order or greater for this new setup. 

It should be noted that we have used a prism in our 
example, which is usually used for lower resolution cross 
dispersion. The alternative is to use a grating, though this 
would work in a similar way. Taking the equation for FSR 
and combining it with a diffraction-limited grating [Rx — 
mpW) yields 



Ni = pW. 



(29) 



Showing the grating size (related to W) will increase as the 
number of inputs increases (given a maximum ruling den- 
sity). 



7.4 Other instrument options 

In its present form it is clear that trying to compete with 
large IFU st yle instruments is not a viable option. As shown 
in (Harris Allington- Smithll2012l l the areas providing the 
greatest advantage would be small or diffraction-limited tele- 
scopes, preferably operating at longer wavelengths with in- 
struments that only require a small field of view. There is 
potential for applications in solar system science, planetary 
and stellar science and studies of individual stars in galaxy 
populations 

Another option is to use multiple single-input AWGs, 
but to combine the output onto a single linear detector ar- 
ray to reduce the number of detector pixels by a factor equal 
to the number of modes per spaxel. This would greatly re- 
duce the cost of the detector system and bring the benefit 
of adaptability of a one-dimensional detector array to the 
output focal surface of the AWG. This would only be pos- 
sible if the pixels had a large aspect ratio. This might incur 
a penalty in terms of extra detector noise; and the num- 
ber of AWGs is not reduced. This option is currently under 
investigation. 

A further option is to reduce the number of modes that 
are extracted from the input multimode light to produce an 
acceptable tradeoff between cost and performance defined as 
a combination of throughput, spectral resolution and field 
size. Options include.. 

(a) Restrict the number of single-mode fibres out- 
put from the photonic lantern with a consequent loss of 
throughput. This may be acceptable because the pop- 
ulatio n of exc ited modes is not likely to be uniform 
(C orbett &: Allin gton-Smith 2006) but will reduce as a func- 
tion of mode number to a cutoff value at high order. Thus 
the overall performance of the system in terms of the prod- 
uct of cost and throughput may be acceptably high. 

(b) Reduce the number of AWGs (and detector pixels) 
by making each work in a partly multimode (i.e. few mode) 
configuration so that the AWG disperses light which is not 
in a single mode. This may be acceptable if high resolving 
power is not required e,g, in a survey of faint, unresolved 
galaxies. 

(c) Reduce the number of AWGs (and detector pix- 
els) by decreasing the field of view. This directly trades- 
off cost with field coverage. This is of relevance to appli- 
cations requiring little spatial multiplex, e.g. single-object 
spectroscopy or spectroscopy with high-order adaptive op- 
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tics s uch that the input im age is aheady near the diffraction 
hmit ([Harris k AningtQn-Smith,2Q12, ). 



8 CONCLUSION 

We have examined the apphcation of Integrated Photonic 
Spectrographs in astronomy and shown that an IPS is 
equivalent in function to an image sheer. We have shown 
that as the telescope diameter increases, the size of an IPS 
must also increase (provided that the slit is not matched to 
the diffraction-limit) due to the increase in number of modes 
in the field (equivalent to the number of diffraction-limited 
slices). We have also shown that the number of modes in 
a field is independent of how the field is initially sampled 
(the size of the sampling element (spaxel) has no effect on 
the total number of modes in the field). 

We modelled IPS instruments to compare them with 
conventional instruments on large telescopes and found that 
they require 10^-10^ Arrayed Waveguide Gratings (AWGs) 
or 10^- IC^ replica spectrographs if bulk optics are used 
for instruments on 8m and 30m telescopes. We found that 



fully-photonic instruments were comparable in size to their 
conventional counterparts but only if the AWG was close to 
the diffraction limit. The semi-photonic instruments were 
found to be much larger, due to the redundancies of having 
multiple spectrographs. 

We have also found that unless the input image is sampled 
near the diffraction limit, the number of component spectra 
in each spaxel is very high, requiring large numbers of pixels 
in the detector array. This is equivalent to oversampling 
the PSF and could also potentially increase detector noise 
in the instrument. 

To combat the problem of size we considered the effect 
of adding extra inputs to the AWG to reduce the number 
of AWGs required. However the resulting instrument was of 
the same size or larger. It also means that the problem of 
oversampling in the linear case remains unsolved and poten- 
tially will be worse since the spectra will need to have gaps 
between them to distinguish them. 

We also examined other options for reducing the num- 
ber of detector pixels and/or AWGs and concluded that in- 
struments of photonic construction may be viable depend- 
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Figure 8. The resulting scale length due to varying the number of inputs to each AWG on each instrument. The different scaling cases 
are show in green and blue with the dashed horizontal line indicating the scale length of the single input instrument. From the figure you 
can see that all the results will produce larger instruments than the single input case. The result for CRIRES is omitted as no sensibly 
sized prism could be found with sufficient resolution to cross disperse the outputs. 



ing on the extent to which performance (including through- 
put, spectral resolution and spatial multiplex) can be traded 
against cost. 

Even without these modifications or restrictions, there 
are some areas where IPSs may offer a significant advantage. 
These include spectroscopy of objects near the diffraction- 



limit, e.g. single objects with high-order AO such as in exo- 
planet studies. Another is low-resolution multiplexed spec- 
troscopy working in the few- mode limit. 
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